Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments
نویسندگان
چکیده
Most listeners have difficulty understanding speech in reverberant conditions. The purpose of this study is to investigate whether it is possible to reduce the degree of degradation of speech intelligibility in reverberation through the development of an algorithm. The modulation spectrum is the spectral representation of the temporal envelope of the speech signal. That of clean speech is dominated by components between 1 and 16 Hz centered at 4 Hz which is the most important range for human perception of speech. In reverberant conditions, the modulation spectrum of speech is shifted toward the lower end of the modulation frequency range. In this study, we proposed to enhance the important modulation spectral components prior to distortion of speech by reverberation. Word intelligibility in a carrier sentence was tested with the newly developed algorithm including two different filter designs in three reverberant conditions. The reverberant speech was simulated by convoluting clean speech with impulse responses measured in the actual halls. The experimental results show that modulation filtering incorporated into a pre-processing algorithm improves intelligibility for normal hearing listeners when (1) the modulation filters are optimal for a specific reverberant condition (i.e., T60 = 1.1 s), and (2) consonants are preceded by highly powered segments. Under shorter (0.7 s) and longer (1.6 s) reverberation times, the modulation filtering in the current experiments, an Empirically-Designed (ED) filter and a Data-Derived (D-D) filter, caused a slight performance decrement respectively. The results of this study suggest that further gains in intelligibility may be accomplished by re-design of the modulation filters suitable for other reverberant conditions. 2004 Elsevier B.V. All rights reserved. 0167-6393/$ see front matter 2004 Elsevier B.V. All rights reserved. doi:10.1016/j.specom.2004.06.003 * Corresponding author. Tel.: +1 503 220 8262x55949; fax: +1 503 273 5021. E-mail addresses: [email protected] (A. Kusumoto), [email protected] (T. Arai), [email protected] (K. Kinoshita), [email protected] (N. Hodoshima), [email protected] (N. Vaughan). 102 A. Kusumoto et al. / Speech Communication 45 (2005) 101–113
منابع مشابه
Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties
Clear speech has been shown to have an intelligibility advantage over casual speech in noisy and reverberant environments. This work validates spectral and time domain modifications to increase the intelligibility of casual speech in reverberant environments by compensating particular differences between the two speaking styles. To compensate spectral differences, a frequency-domain filtering a...
متن کاملEffects of suppressing steady-state portions of speech on intelligibility in reverberant environments
1. Introduction When listening to a lecture in a large auditorium it is often difficult to understand the speech. Among other factors, comprehension may be impaired by reverberation, which is sound reflecting from the wall, interfering with direct sound. Based on the modulation transfer function (MTF), the speech transmission index (STI) has been proposed as an objective measure for speech inte...
متن کاملDesigning modulation filters for improving speech intelligibility in reverberant environments
In this paper, we propose a new technique to design modulation filters to reduce degradation of speech intelligibility in reverberant environments. Using the inverse modulation transfer function, we design data-derived modulation filters for each speech frequency band. These filters preprocess speech signals between a microphone and a loudspeaker that radiates speech into a performance hall. Us...
متن کاملTemporally Enhanced Speech Is More Intelligible in Reverberant Environments
Reverberation causes degradation in speech comprehension, especially for elderly people, the hearing-impaired and non-native listeners. In order to prevent intelligibility degradation, we developed several pre-processing techniques, where signals are processed before being radiated through the loudspeakers of a public address system. The two main techniques are modulation filtering and steady-s...
متن کاملSuppressing Steady-state Portions of Speech for Improving Intelligibility in Various Reverberant Environments
In previous studies (Arai et al., 2001; Arai et al., 2002), we hypothesized that segments of an acoustic signal are masked by reverberation components of previous segments, degrading speech intelligibility. To reduce masking influences, we suppressed steady-state portions having more energy, but which are less crucial for speech perception. We have presently conducted a perceptual test with a s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 45 شماره
صفحات -
تاریخ انتشار 2005